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Introduction. Seif-efficacy refers to a person's belief that he or she can carry out a 
^ particular behavior. Not long ago, Bandura (1977a) proposed self-efficacy as a good 
^ explanation for behavior change during psychotherapy. In the past decade, 

self— efficacy has traveled well: #Not only is there increasing interest in the construct, 
Qs but it is now attached to a sweep of behaviors far beyond its origins in anxiety and 

OJ phobia. A quick check of Psychological Abstracts shows connections between 

O self— efficacy and sports skills, academic perfornfiance, health practices, and 

t j J socialization behaviors. Self— efficacy now occupies a centre) role in Bandura's (1986) 

social learning theory. 

There is growing^ sizable body of research suggesting that efficacy 
expectations influence motivation, persistence, and accomplishment* Nearly all 
studies, however, have focused on the content of self-efficacy rather than its 
measurement. Sometimes the measurements are so casual or confounded that one 
wonders what is actually being assessed. The present study concentrates on the 
development and estimation of measurement properties of a college-level academic 
self— efficacy scale. 

Instrument Background. Bandura (1984) has stressed specificity in his self-efficacy 
scales, and items are usually arranged hierarchically [e.g., lifting 10 pounds, lifting 20 
pounds, etc.). The measurement often follows a two-step approach: First, a 
subject notes whether or not he czn perform behavior A; then he indicates the 
"strength" of efficacy on a 90-point certainty scale. (Although Eastman and 
Marziliier [1984] have complained about using a 90 — rather than 100 — point scale, 
the practical difference is tiny.) The reason for the first step is to assess the 
"magnitude" of self-efficacy by summing the number of *c3n-do" assertions. An 
easier approach to measuring magnitude might be to indicate the level of task in the 
hierarchy where a person first claims "cannot-do." On the other hand, any 
measurement of magnitude may be flawed if "can-do" is modified by weak certainty 
in the second step. 

Strength and magnitude sound like different aspects of self-efficacy, and 
theorists such as Cervone (1987) continue to promote their conceptual differences. A 
strictly empirical approach is to ask whether the two measures give unique and useful 
information about expectations or behavior. The small evidence in the literature 
shows that the two scales show substantial overlap, in their measure of academic 
self- efficacy. Wood and Locke (1987) showed correlations of .64 to .67 between 
strength and magnitude estimates. And these are spuriously low: the magnitude 
■scales suffered from a severe ceiling effect (and thus restriction of range)" (p. 
^ 1015). Wood and Locke's magnitude scales inconsistently explained unique variation 

^ in predicting academic goals and course performance, but the contribution was so 

small (averaging 2% of the criterion variance) that it has slim practical worth. 
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Interestingly, Wood and Locke concluded that the Inconsistent results from magnitude 
measurement provide good reason to use "both (strength and magnitude] measures 
when assessing self-efficacy" {p. 1023). 

Lee (1984) found a correlation of .83 between strength and magnitude 
scales of assertiveness.f?chtev> OSST) found a correlation of .95 between the measures 
and decided to drop the "strength" data from the rest of their research. In 
summary, strength and magnitude of efficacy beliefs are redundant, and if the analyst 
uses regression procedures, he invites garbled results because of multicollinear 
variables. 

Further difficulties emerge because Bandura's Guttman-type hierarchical 
scales give spurious internal consistency estimates and factorial validities. 
Correlations approaching 1.00 between an item and prece^iing (easier) items seem 
guaranteed. Also, applying a hierarchical scale to cognitive behaviors may confuse 
the respondent. Wood and Locke (1987) developed a college academic self-efficacy 
measure in which they asked students about six behaviors, repeated across various 
percentages (termed "proportions" by Wood and Locke): 

1. Memorize 60% of the facts ic concepts [covered in this course] 

2. Memorize 70% of the facts St concepts 

3. Memorize 80% of the facts it concepts 

etc. How are students able to distinguish among these levels? What 
frame of reference do students use for the 100% universe of course facts and 
concepts? 

Other researchers have developed their own styles of measuring 
self-efficacy. Most of these meaijures are self-report, Likert-type scales. Reliability 
estimates are infrequently offered, and one wonders about the likely reliability of 
one-item (Newman &. Goldfried, 1987) or 3-item scales (Maddux et ai, 1980). In a 
later paper, Maddux et ai, (1986) give a modest alpha estimate for their 3-item 
self-efficacy scale: .68. (In passing, we should acknowledge that some self-efficacy 
scales demonstrate good reliability [and validity] properties. Perhaps the most 
carefully developed and thoroughly studied is Wheeler and Ladd's [1982] Children's 
Self-Efficacy for Peer Interaction Scale [see also Perry at ai, 1986: Ladd it Price. 
1986]). 

Although Bandura (1986) has repeatedly stressed the difference between 
performance expectations (self-efficacy beliefs) and outcome expectations, some 
researchers disregard the distinction and build complex items that cover both beliefs. 
For example, "When a student gets a better grade than he usually gets, it is usually 
because I found better ways of leaching that student" (Gibson U Dembo, 1984). 
Others simply Ignore the idea that self-efficacy beliefs refer to relatively 
circumscribed behaviors and develop massive questions: "How confident are you?", 
■How persistent are you?" (Englert L Tomlinson-Keasey, 1986); "If anyone can find 
the answer to what is troubling my baby, I am the one" (Cutrona it Troutman, 
1986); "I feel that I am making a significant difference in the lives of my students" 
(Hoover-Dempsey et ai, 1987); "If parents would do more with their children, I 
could do more" (Gibson it Dembo, 1984). Stii! others use Items that are obviously 
outcome beliefs and choose to call them self-efficacy questions (e.g., Ashton it 
Webb, 1986). 

Through the haze of available measurements, many of them offering weak 
or unknown psychometric properties, it seems unlikely that self-efficacy can penetrate 
clearly. For researchers who invest considerable energy in a project, it is tempting to 
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1. GPA 

2. CASES 



.40 
.78 



127 



enjoyment 



1. GPA 

2. CASES 



.38 
.72 



79 



In short, academic self-efficacy showed very strong incremental validity beyond that 
explained by GPA alone. 

In a variation of these concurrent validity studies, the two samples were 
combined, and course grade was regressed hierarchically on GPA, then CASES score. 
The addition of CASES increased R from .62 to .81. 

To study factorial validity, responses from three samples were combined, 
and an exploratory principal factor analysis was performed. Three factors with 
eigenvalues above 1.0, explaining 78% of the systematic item variance, were sent to 
an orthogonal rotation. A clear and easily interpretable structure emerged. The 
three factors were called (1) Overt, Social Situations {e.g., participating in a class 
discussion), (2) Cognitive Operations (e.g., listening carefully during a lecture on a 
difficult topic), and (3) Technical Skills {e.g., using a computer). 

Finally, a new sampleOi = 122) was asked to estimate the difficulty of 
performing each of the behaviors listed in CASES for most students. Mean item 
difficulties were calculated and inspected for general confirmation of self-efficacy 
theory. We found that the behaviors rated least difficult were ones that students 
probably had had the most experience — and success — with (examples: attending 
classes regularly, performing simple math computations). And the behaviors rated 
most difficult were those students rarely try, rarely succeed at, and rarely observe 
models succeeding at (examples: challenging a professor's opinion in class, answering 
a question in a large class). This intuitive analysis seems to confirm predictions 
made by self-efficacy theory (Bandura, 1986). 

Educational Importance. The preliminary measurement properties of 
CASES are encouraging. In addition, CASES is a rapid (5 minutes) and simple tool 
for assessing academic self-efficacy. We are now testing a very practical application 
of the instrument. Because those with low academic self-efficacy may be at risk for 
probation or dropping out, selected students are targeted for academic counseling to 
prevent failure. In fact, certain CASES items are specific enough to give explicit 
diagnostic direction (for example. Spreading studying out instead of cramming). 

Additional validity studies are being prepared. Objective criterion 
measures — as opposed to self-report — are being collected. We are studying how 
to modify the scale to accommodate students majoring in unusual areas, such as art 
or music. We are also planning a study of how social desirabilty may influence 
self-report efficacy scores. Finally, we are investigating whether item weighting may 
improve CASES' measurement properties. 
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How much confidence do you have about doing each of the t3ehaviors 
beJow? Circle the letters that best represent your beliefs. 



A B C D E 

very < > quite 

little CONFIDENCE lot 

A B C D E i. Taking well organized notes during a lecture. 

A B C D E 2. Participating in a class discussion, 

A B C D E 3. Answering a question in a large class. 

Figure 1. Example items from CASES. 
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